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Abstract 

The c-Balanced Separator problem is a graph-partitioning problem in which given a 
graph G, one aims to find a cut of minimum size such that both the sides of the cut have 
at least cn vertices. In this paper, we present new directions of progress in the c-Balanced 
Separator problem. More specifically, we propose a family of mathematical programs, that 
depend upon a parameter p > 0, and is an extension of the uniform version of the SDPs proposed 
by Goemans and Linial for this problem. In fact for the case, when p = 1, if one can solve this 
program in polynomial time then simply using the Goemans- Williamson's randomized rounding 
algorithm for Max Cut [TT] will give an 0(l)-factor approximation algorithm for c-Balanced 
Separator improving the best known approximation factor of 0(\/\og n) due to Arora, Rao 
and Vazirani [3]. This family of programs is not convex but one can transform them into so 
called concave programs in which one optimizes a concave function over a convex feasible set. 
It is well known that the optima of such programs lie at one of the extreme points of the feasible 
set |26j . Our main contribution is a combinatorial characterization of some extreme points of 
the feasible set of the mathematical program, for p = 1 case, which to the best of our knowledge 
is the first of its kind. We further demonstrate how this characterization can be used to solve 
the program in a restricted setting. Non-convex programs have recently been investigated by 
Bhaskara and Vijayaraghvan [6] in which they design algorithms for approximating Matrix p- 
norms although their algorithmic techniques are analytical in nature. It is important to note 
that the properties of concave programs allows one to apply techniques due to Hoffman [18] or 
Tuy et al [55] to solve such problems with arbitrary accuracy that, for special forms of concave 
programs, converge in polynomial time. 



*A significant portion of this work was done when the author was a B-Tech, M-Tech dual degree student at 
HT-Kanpur, India 



1 Introduction 



Graph partitioning is a problem of fundamental importance both in practice and theory. Many 
problems belonging to the several areas of computer science namely clustering, PRAM emulation, 
VLSI layout, packet routing in networks can be modeled as partitioning a graph into two or more 
parts ensuring that the number of edges in the cut is "small" . The word "small" doesn't refer to 
finding the min-cut in the graph as it doesn't ensure that the number of vertices in both sides of the 
cut is large. To enforce this balance condition one needs to normalize the cut-size in some sense. 
For the known notions of normalization like conductance, expansion and sparsity, finding optimal 
separators is NP-hard for general graphs. Hence, the objective is to look for efficient approximation 
algorithms. Because of the huge amount of work done to design good approximation algorithm for 
these problems, graph partitioning has become one of the central objects of study in the theory of 
geometric embeddings and random walks. Two fundamental problems which we will focus on are 
Sparsest Cut and Balanced Separator. These graph partitioning problems originally came 
up in the context of multi-commodity flows in which we are given a graph with capacities on the 
edges and a set of pairs of vertices (also called source-destination pairs) each having a demand 
and the aim is to find a cut that minimizes the ratio of capacity of the cut and the total demand 
through the cut. When the demand and capacities are all unit then the problem is called uniform 
and in case of general demands and capacities the problem is called non-uniform. 

1.1 Uniform Version 

The first approximation algorithm for such graph partitioning problems, came out of the study 
of Reimannian manifolds in form of the well known Cheegar's Inequality [8] which says that if 
&(G) is the conductance of the graph and A is the second largest eigenvalue of graph Laplacian 
then 2<3?(G) > A > <&(G) 2 /2. Because of the quadratic factor in the lower bound, the true ap- 
proximation is fljTTjr which in worst case can be O(n) in worst case. The first true approximation 



algorithm for Sparsest Cut and Graph Conductance was designed by Leighton and Rao [22] 

whose approximation factor was O(logn). This also gave an O(logn) pseudo-approximation 
algorithm for c-Balanced Separator. This algorithm is referred to as a pseudo-approximation 
algorithm because instead of returning a c-balanced cut, it returns a c'-balanced cut for some fixed 
d < c whose expansion is at most O(logra) times the optimum expansion of best c-balanced cut. 
Their algorithm was based on an LP framework motivated from the idea of Multi-commodity flows. 
Their main contribution was to derive an approximate max-flow, min-cut theorem corresponding 
to multi-commodity flow problem and the sparsest cut. Subsequently, a number of results were 
discovered which showed that good approximation algorithms exist when one is considering ex- 
treme cases such as the number of edges in the graphs is either very small or very large. In fact, 
it is known that for planar graphs one can find balanced cuts which are twice as optimal [10] and 
for graph with an average degree of J7(n), one can design (1 + e)-factor approximation algorithms 
where e > with running time polynomial in input size [2] (such an algorithm is called a Polyno- 




mial Time Approximation Scheme or PTAS). The approximation factor of O(logn) was improved 
to 0(\/log n) in a breakthrough paper by Arora, Rao and Vazirani [4]. Their algorithm is based 
on semi-definite relaxations of these problems . The techniques and geometric structure theorems 
proved in their paper have subsequently led to breakthroughs in the field of metric embeddings. The 
basic philosophy behind these approximation algorithms is to embed the vertices of the input graph 
in an abstract space and derive a "nice" cut in this space. Recently, following a series of papers 
graph expansion has been related to the Unique Games that ultimately led to sub-exponential 
time algorithms for Unique Games [5]. 

1.2 Non-uniform Version 

The non-uniform version of the cut problems is inextricably linked with low distortion metric em- 
bedding. It is easy to see that cut problems can be framed as optimization over l\ metric which in 
general is NP-Hard. So the incentive is to embed the points in a space on which one can optimize 
efficiently for eg. the t\ metric over which can optimize using SDPs. More specifically, using ideas 
from ARV and the measured descent technique of [14], firstly Lee [16] gave an O(logn) approxi- 
mation algorithm for the non-uniform Sparsest Cut, which was later improved to 0(log 3 / 4 n) by 
Chawla, Gupta and Rache [12]. A major breakthrough came from Arora, Lee and Naor [15] who 
improved this bound to O ( n log log n) almost matching an old lower bound due to Enflo [T7] 
which says that there is an n point metric in l\ which need 0(- v /log n) distortion to be embedded 
into 12- 

1.3 Negative Results 

Graph partitioning problems like Sparsest Cut and Balanced Separator are considered to 
among the few NP-hard problems which have resisted various attempts to prove inapproximability 
results. After the result of ARV, there has been a lot of impetus towards proving lower bounds on 
approximation factors. It has been shown by Ambuhl et al [I] that Sparsest Cut can't have a 
PTAS unless NP-complete problems can be solved in randomized sub-exponential time. Because of 
the strong connections between semi-definite programming and the Unique Games Conjecture 
(UGC) of Khot [20] . inapproximability results are also known which assume UGC More specifically, 
in a breakthrough result, Khot and Vishnoi |21] showed that UGC implies super-constant lower 
bounds on the approximation factor for the non-uniform version of the problems. Lee and Naor 
[24] gave an analytical proof of the result that by exhibiting an n point metric on the Heisenberg 
Group that is of negative type and needs w(l) distortion to be embedded in l\. Recently, it has 
been shown by Cheeger, Kleiner and Naor [13] . that the integrality gap of the non-uniform version 
of the sparsest cut SDP is f^log ^ n). Devanur et al [9] showed that the integrality gap of the 
SDP relaxation of Arora-Rao- Vazirani is ri(loglogn) thereby disproving the original conjecture of 
ARV that the integrality gap of their SDP relaxation is atmost a constant. 



1.4 Non-Convex Programming 

In this paper we work with a form of non-convex programs called Concave Programming. In 
order to define concave programming one first needs to define a concave function. A function 
/ : R d —7- R with domain dom/ is said to be concave if dom/ is convex and for all x, y £ dom/, 
/(Ax + (1 — A)y) > Xf(x) + (1 — A)/(y) for all A e [0—1]. Therefore, / is concave iff — / is a convex 
function. Based on this definition one defines concave programming as a form of mathematical 
programming in which one optimizes a concave function over a convex feasible set. More formally, 
a concave programming problem can be written as [mm x ^c f( x )] where C is a convex set in R d 
and / is a concave function. The following is well known result for concave programming |19j . 

Fact 1. For every concave programming problem there is an extreme point of the convex feasible 
set C which globally minimizes the optimization problem. 

The first algorithm for concave programming was designed by Tuy |27] in a restricted scenario 
when the feasible set is a polytope. A more general case, when the feasible set is convex but not 
necessarily polyhedral, was solved by Horst |19] and subsequently by Hoffman [18] . Tuy and Thai 
|28j . General concave programming is NP-hard as {0, l}-integer programming can be cast as a 
concave program. There has been work towards designing efficient algorithms for some special 
class of concave programming. A comprehensive list of works done in concave programming can 
be found in Vaserstein's homepage |31] , Recently, using analytical techniques Bhaskara and Vija- 
yaraghvan [6] have successfully used non-convex programs to design algorithms for approximating 
matrix p— norms. 

1.5 Our Contributions 

Our main contribution is to initiate the study of combinatorial geometric properties of a non- 
convex relaxation for the - Balanced Separator problem. We show that an efficient solution to our 
proposed program will imply improved an approximation algorithm for this problem. In section 
2, we formally introduce the notions of sparsity and balanced cuts and sketch the Semi-Definite 
relaxation for c-Balanced Separator of ARV. We then start section 4 by introducing a family 
of relaxations for c-Balanced Separator which is generated by a parameter p > and show 
that one can use its solution to design an O(l) -factor approximation algorithm for the problem. 
Our result, although conditional, proposes new directions of progress on this problem and also 
a family of optimization problems which are more powerful than semi-definite programs in the 
context of approximation algorithms. Section 5 and 6 are devoted to find interesting properties on 
the geometry of the feasible region of our program and in section 7 we show how these properties 
can be used to design an efficient algorithm to search over a subset of extreme point called vertices. 
We end the paper with Section 8 in which we present conclusions and future directions. 



2 Problem Definition 



We now formally define the versions of balanced graph partitioning problem that we focus on, in 
this paper. c-Balanced Separator 

Given a graph G = (V,E) with \V\ = n, \E\ = m, the c-Balanced Separator problem is to find 
a c (G) where a c (G) = min E(S,S). 

ScV,cn<\S\<(l~c)n 

Although out techniques can potentially be generalized to Sparsest Cut H and other balanced 
graph partitioning problems. 

2.1 SDP Relaxation for c-Balanced Separator 

Unifying the spectral and the metric based (linear programming) approaches, ARV used the fol- 
lowing SDP relaxation to get an improved (pseudo)-approximation algorithm for the c-Balanced 
Separator. Let us call this program SDPbs, 
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\\vi\\ 2 = 1 \/i 
| 2 + \\vj - v k \\ 2 > \\vi - v k \\ 2 Vi,j,k 



Kj 



\vi — vj\\ 2 > 4c(l — c)n 2 



It is easy to see that this indeed is a vector program (and hence an SDP) and is a relaxation 
for the c-Balanced Separator problem. To show that this is a relaxation we have to show that 
for every cut we can get an assignment of vectors such that all the constraints are satisfied and the 
value of the objective function is the size of the cut. Given a cut (S, S) if one maps all the vertices 
in S to a unit vector n and the vertices in S to — n then the value of the function is indeed the 
cardinality of E(S, S). The main idea behind their algorithm is to show that for any set of vectors 
which satisfy the constraints of the SDP there always exist two disjoint subsets of "large" size such 
that for any two points belonging to different subsets the squared Euclidean distance between them 



is at least y-y==j. The same idea is also used to get an improved approximation algorithm for 

Sparsest Cut in Subsequently, this key idea has crucially been used in various other SDP 
based approximation algorithms and in solving problems related to metric embeddings. 



1 In [3] c-Balanced Separator is defined as the minimum sparsity of c-baianced cuts, we will be working with 
a definition which upto constant factors is equivalent to their definition 

2 Given a graph G = (V, E) with = n, \E\ = m, for each cut (S, S) define sparsity of the cut to be the quantity 
A(S) — ^-j5^Jp- The uniform sparsest cut problem is to find a(G) where 

a(G) = min A(S). 

v ' scv v ' 



3 Non-Convex Relaxation for c-Balanced Separator 



Consider the following family of optimization problems which depend on a parameter p > 0. This 
family is essentially an extension of the semi-definite program proposed by ARV. Throughout the 
paper we will use |.| to represent the I2 norm. Let us call this family of programs Fg S . 

i,j£E 

\\vi\\ 2 = 1 Vi 
\\vi - Vj\\ p + \\vj - v k \\ p > \\vi - v k \\ p Vi,j,k 
^ II Vi - Vj\\ 2 > 4c(l - c)n 2 

i,j£E 

Note that for p = 2 this is the SDP relaxation used by ARV. For p = 1, we are mapping 
the points onto a unit sphere, therefore we do not have to force the additional triangle inequality 
constraint of I2 metric. The same mapping described for SDPbs of the vertices of the graph onto 
the unit sphere allows us to conclude that each program in this family is also a relaxation for 
c-Balanced Separator. In most part of the paper we will be working with the case p = 1. Now 
it is easy to see that if we can solve this program for p = 1, then simply using the randomized 
rounding algorithm of Goemans and Williamson [IT] will give an 0(l)-approximation algorithm for 
the problem. This is because of the fact that the last constraint ensures that a random hyperplane 
will find two sets of large size on both sides with constant probability pQ. Another way to look at 
it is that in this program we are actually embedding the points in an I2 metric which in turn is in 
l\ metric. Therefore we have the following theorem: 

Theorem 1. An efficient algorithm for solving F^ s forp = 1 implies an 0{\)-factor approximation 
algorithm for c-Balanced Separator. 



4 A Concave Programming Formulation 

In this section, we consider the family of optimization problems Fg S proposed above and transform 
it into a concave program. This formulation allows us to use the algorithms which have been 
developed to solve a concave program with arbitrary accuracy. We now write F^ s as a program 
with variables as matrix entries and not as <i-dimensional vectors. The variables in the new program 
are of the form Xij = (v{, Vj). Since all Vi's are unit vectors we can write \\vi — Vj\\ as y/2 — 2(vi,Vj). 
If we consider the matrix X with ij th entry the transformation Zij = (1 — Xij), the new 

optimization problem becomes: 

1 p/2 
mm ^ z ij 

i,j€E 



? P/2 , P/2 > p/2 y . . , 

Z ij + Z jk - Z ik Vi >-?' K 

^2 z v — c (l ~~ c ) n2 
% = Vi 

i-zyo 

where 1 is the matrix with all entries as 1. 

Let us call the above program Fg S . This formulation allows us to prove the following lemma: 
Theorem 2. F^ s is a concave program for < p < 2. 

Proof. See Appendix. □ 



5 Case p = 1 

In this case the our feasibility problem now looks like the following: 

min \ 2 V^J 



Zij > c(l — c)n 2 

Zjj = Vi 
1 - Zh 

where 1 is the matrix with all entries as 1. Since —1 < Xij < 1, < < 2. Let us denote the 
region in W d , which satisfies the last two constraints as V, the triangle inequality constraints as T 
and the "well-separated" constraint as T~L. We will denote by T the feasible region. Also if C is an 
inequality constraint, then C* be the equality constraint corresponding to it. The following is an 
easy observation which follows essentially from the definition. 

Observation 1. PCf. 



5.1 The 3-Dimensional Intuition 

If we just focus our attention to 3- variables and look at the feasible set (with out the positive semi- 
definite constraint), then because of the nature of the triangle inequality constraints the geometry 
of the feasible set looks as shown in the Figure 15.11 Notice that the feasible set is not polyhedral 
but it has one dimensional line segments on its boundary. The shaded region enclosed by the points 
Pi,P2,P3,Pi,P5,P6 arid py, depicts the feasible set. The line segments which are on the boundary of 
this object are piP6, P3P5, P2P4- P§P7, P7Pi and P5P7. Also P1P2, P2P3 and pspi are non-linear arcs 
which are on the boundary of the feasible set. Given this description the following is easy to show. 



Figure 1: The picture in 3-D 



Lemma 1. Let f = x\ + x\ + x% where p < 1/2 be the objective function to be minimized over T , 
then the optimum is achieved at one of extreme points p\ , p2 or P3 . 

6 Combinatorial Geometry of the Feasible Set 

In this section we separately consider the constraints and develop tools to understand the geometry 
of the feasible set which can potentially help us in getting an efficient algorithm to solve the 
feasibility problem. 

Our aim in the sections to follow is to give a tight characterization of the "vertices" of the proposed 
program which are defined as follows: 

Definition 1 (Vertex). A point p £ M. d is called a vertex-set of the feasible set T if p £ T and 
there exists equality constraints C%, C|, . . . C* such that p = f] r k Cj*. 

Definition 2 (Arc). An arc a of J 7 is a closed one- dimensional curve joining two vertices of T 



such that there exists equality constraints , C| , . . . C* suc/i i/iai a = fY C* 

Notice that we can analogously define vertices and arcs corresponding to the regions V,T-L and 
T ■ From the previous section it is clear that some arcs are line segments while others are not. The 
arcs which are line segments we will call them edges. In the subsequent sections we will consider 
the constraints separately. 

6.1 The Triangle Inequality Constraints 

We will now look at the geometric structure posed by the Triangle Inequality Constraints inside 
the [0 — 2] d hypercube and prove some structural results on those. 

Definition 3. Let 1Z denote the region inside [0 — 2] d that is formed by the intersections of the 
constraints Z{j + Zjk > z^i, for all i,j, k. 

Observation 2. The vector is a vertex ofT. 

Observation 3. e(p) is an edge(vertex) ofT iff it is an edge(vertex) oflZ. 

We now take a deviation and first characterize all the symmetric n x n matrices with 0, 1 en- 
tries and main diagonal as which satisfy the triangle inequality constraints. Since such matrices 
represent the adjacency matrix of some graph, we essentially need to characterize all graphs whose 
corresponding matrices satisfy the triangle inequality constraints. As we will see later this char- 
acterization will be helpful in analyzing the geometry of the triangle inequality constraints. But 
before that, we first need to define the following class of graphs: 

Definition 4. A graph G(V, E) is called partial- clique if there exists pairwise disjoint sets S± , S2, ■ ■ ■ S r C 
V, such that G = Ky \ {IJi=i -^Sil where K$ denotes edges in the complete graph on S C V . 

We also define the following which will be of our interest later and subsequently prove a series 
of combinatorial results based on these. 

Definition 5. A partial- clique G = Ky \ {UI=i ^S-} over a vertex set V is called a multi-clique 
ifUUSi = V. 

(In the literature these graphs are popularly known as multi-partite graphs.) For the sake of 
brevity we will assume that the empty graph (V = 0) is a multi-clique. We will also allow, again 
for the sake of simplification, that one can take Si's of cardinality 1. 

Theorem 3. A graph satisfies the triangle inequality constraints if and only if it is a partial- clique. 



Proof. It is easy to see that if the graph is a partial clique then it satisfies the triangle inequality 
constraints. Let Si, S2, ■ ■ ■ , S r be the corresponding set of subsets. Consider any three vertices 
v ii v j7 v k: the following cases may arise: (i) none of of them lies in a subset S, (ii) all three lie in 
distinct S^ , Si 2 , Sj 3 (iii) two of them lie in a set S^ and one lie in Sj 2 (iv) only one of them lies in a 
subset Sjj (v) all lie in the same subset Sj. In all these cases one can easily verify that the triangle 
inequality holds. Hence a partial-clique satisfies the triangle inequality constraints. 
For the converse part consider a graph that satisfies the triangle inequality constraints and assume 
that it is not a partial-clique. Clearly the graph can't be disconnected because in that case one can 
easily find three vertices violating the corresponding triangle inequality. The triangle inequality 
essentially says that for all i,j,k, if two edges ViVj and VjV^ are not present in the graph then 
the edge ViVt should also not be present. Now assume that the complement of the graph has m 
connected components. Consider any one component say H. Let Si C V be the set of vertices 
in the component H with an edge v^j G H. If H has just these two vertices % and Vi 2 then it 
is already a clique. Let vi 3 be a vertex which share an edge with or vi 2 . W.l.g let it be v^. 
Since v^v^ and v^v^ is in the complement, Vi 3 Vi 2 also have to be in the complement, which forms 
a 3-clique. We can repeat the same argument for the next vertex Vj 4 , which will share an edge 
with at least one vertex in {v^, Vi 2 , Vi 3 }, to show that it forms a 4-clique. Repeating this argument 
for all the vertices of the component we can show that if is a clique. The same holds for all the 
components. □ 

Lemma 2. The edges of 1Z are of the form XB where B is a bi- clique on V = {1,2,... , n}. 

Proof. One way is easy to verify. For the other side, Notice that if the intersection of a set of 
equalities actually results into a line then there will be a set of variables Xij such that all of them 
are equal and rest are all zeros (hence there is just one variable). Therefore the line will actually 
be a vector with some entries as A and rest as 0. Since the 0/1 vectors which satisfy the triangle 
inequalities are the partial-cliques such a vector with pass through a partial clique G v . Consider 
the graph G\v which represents a weighted partial-clique with all edges with weight A. Now if such 
a partial clique is not a bi-clique then either one of the two cases are possible: 1. One can find an 
edge {i,j} such that weight of {i,j} is A and there is no pair of the form {j, k} or {i, k'} such that 
w {j,k} = or ^{i,fc'} = 0. 

2. G\ is a multi-clique with A as edge weights. 

If it is the first case let {i,j} and {i±,ji} be the two pairs which have weight A and since both 
have value A, the intersection of the planes which we have chosen implies Zij = z% x j x - This means 
there must be some be some k such that one of Zu + Zjk = Zik, Zu + z^ = zjk and z^ + zjk = z^ is 
chosen and Zj/. = Zi 1 j 1 or z^ = Zi 1 j 1 is implied by the rest of the planes chosen. But in both these 
cases, one of Zik or Zjk is zero which is a contradiction. 

For the second case, let i, j, k be three vertices lying in sets Vi, V2 and V3, whose cliques are removed, 
respectively. Now due to the way equality of two variables is implied by a set of constraints, for 



the edge {i,j} to have the same value as {i,k} there must exist some f G Vi and k' G V3 such 
that the plane containing variable Xiy and %iy is chosen, buth this implies Xj/y = which is a 
contradiction. □ 



Lemma 3. Let T be the set of all 0/1 vectors in the hypercube which satisfy the triangle inequality 
constraints, then T is exactly the set of all 0- dimensional faces oflZ. 

Proof. It is easy to see that the vertices of the cube are re-dimensional 0/1 vectors and edges 
are formed by joining those vertices which have hamming distance 1. Since 1Z is essentially the 
intersection of the Hamming cube with the unbounded polytope corresponding to the triangle 
inequalities, all the vertices of the cube which satisfy the triangle inequality constraints will also 
be the vertices of 7Z. We only need to show that there is no other vertices of 1Z. Since vertices 
form the boundary of the edges, the vertices of 1Z are formed as a result of the intersection of edges 
of the cube H with V or edges of V with H. But we can show that every edge of H intersects 
the boundary of V only at its end points and also that every edge of V intersects the supporting 
planes of rl only at the vertices of T~L hence the vertices of 1Z can only be the vertices of rl. To 
show the first claim let (012, 013, . . . , A, . . . , a^ n _i^ n , a nn ) be an edge of rl in which all a^-'s except 
one are fixed to either or 1 and only one coordinate is varying as A G [0 — 1]. Now consider any 
plane corresponding to the triangle inequality constraints of the form Xij + Xjk = x^. Since there 
is only one co-ordinate in the line this equality can't be satisfied for any < A < 1, and hence 
intersection is only possible when A is either or 1. Based on Lemma ?? it is easy to see that the 
edges intersect the supporting planes of rl only at vertices of rl. □ 

Theorem 4. The line segment joining two vertices u and v oflZis an edge of 1Z if and only if the 
subgraphs of G\ u ^_ci—\) v corresponding to the edges with weights A and 1 — A respectively are both 
bi-cliques. 

Proof, (if part) Let u and v be the vertices of 7Z and G u = Ky \ {|Ji=i } G v = Ky \ {U i=1 KRi } 
(since both are bi-cliques). Recall that by definition for any two vertices Ui and Vj in Si n Rj the 
edge between them is not present. Also the set of edges with weight A will be those which are 
present in G u and not in G v vice versa for edges with weights 1 — A. Let H\ and H\_\ be the 
subgraphs comprising of edges with weights A and 1 — A respectively. Consider H\. Since this 
graph is given to be a bi-clique we can assume it to be H\ = Kyi \ {Kt x U Kt 2 }- We now choose 
hyperplanes such that their intersection gives us G\ u+ ^i_x) v - For every i,j,k G V' such that i,j is 
in some and k is in some Tj 2 , i±,i2 G {1,2} with 7^ Tj 2 choose the hyperplanes Zij + Zjk = Zik 
and z-ij + z^ = Zjk among the set of planes. Note that this implies that Zik = zjk and z^ = 0. As a 
result of selecting these hyperplanes we will get all the variables z%j where {i,j} is an edge in H\ 
to be equal. Take this equal value as A. Also for all {i,j} which are not edge set of H\ will have 
weight 0. Repeat the same exercise of choosing hyperplanes for the subgraph H\_\ but this time 
instead of taking the equal value as A take the value as 1 — A. For all the rest of the edges {i, j} with 
weight 1 choose the hyperplane z^ = 1. To link these values we need to choose some other planes. 



For all k such that {i,j} G H\ and {j, k} G Hi~\ choose the plane Zy + Zjj. = z^. It is now 
easy to verify that the intersection of all these planes indeed gives the line segment Xu + (1 — \)v . 

(only if part) Let u and v be vertices of TZ and the graph Gau+(i-A)u doesn't satisfy above 
mentioned condition. Similar to Lemma ??, it can be verified that in this case it is always the case 
that at least one the following two scenarios will arise: 

1. The subgraph H\ has more than one edge and one can find a pair {i,j} such that wu j} is A (or 
1 — A) and there is no pair of the form {j, k} or {i, k'} such that wujt} = or wu^n = 0. 

2. The subgraph H\ (or H\_\) is a collection of disconnected multi-cliques. 

If it is the first case let {i,j} and j±} be the two pairs which have weight A (w.l.g. assume 
it is A) and since both have value A, the intersection of the planes which we have chosen implies 
Zij = Zi 1 j 1 . This means there must be some be some k such that one of Zij + Zjk = z%k , %ij + Zik = Zjk 
and Zik + Zjk = z%j is chosen and Zjk = Zi 1 j 1 or Zik = Zi 1 j 1 is implied by the rest of the planes chosen. 
But in both these cases, one of z^k or Zjk is zero which is a contradiction. 

For the second case, let {ii,ji} and {^2,^2} be two pairs which are in different multi-cliques but 
w {h,h} = w {n,h}- Therefore, Zi 1 j 1 = zi 2 j 2 must be implied by the chosen set of hyperplanes. But 
from the discussion presented before, such a scenario implies that both and %2ji have to be in 
a connected graph which has to be a biclique. □ 

6.2 Positive Semi-Definite Constraint 

We now investigate the surface defined by the positive semi-definite constraint 1 — Z y 0. From 
Observation 1, the region defined by the this constraint is enclosed inside the region defined by the 
triangle inequalities. In this direction we will prove certain interesting results again relating the 
graphs which some of these matrices correspond to. 

Given a symmetric n x n matrix A with ±1 entries define a new matrix A such that A^j = 1 if 
Aij = — 1 and Ay = if Ay = 1. The matrix A can be treated as the adjacency matrix of a 
graph on vertices {i>i,i>2, • • • v n }. We now prove the following lemma which will be of interest in 
the further discussion. 

Lemma 4. Given a symmetric matrix A = [aij] with ±1 entries, the expression E(xi,X2, . ■ ■ , x n ) = 

n n 

aaxf + 2 aijXiXj is non-negative for all x',- L s G K, iff there exist b\, 62, ... b n G {1, —1} such 

i i<j 

that E can be expressed as (b\Xi + 62X2 + . . . + b n x n ) 2 . 

Proof. Clearly one way is trivial, i.e. if E is of the above form then it must be non-negative. For 
the converse part we have to show that for all expressions E which are not of this form we can find 
some values of Xi's i = 1, 2, . . . , n, for which the value of expression these choice of Xi's becomes 
negative. We will denote x = (x±,X2, • • • , x n ). 

Firstly, it is easy to see that any E that is non-negative for all a^s must have the values of a' u s as 



1 because if any an = — 1 then the expression will be negative for the vector x which is a at the 
i th position and otherwise where a is a non-zero number. Now we would show that if E is not of 
the form (b±xi + 62^2 + • • • + b n x n ) 2 then there always exists a triple i,j, k all three distinct such 
that among aij,ajk,aik either all are -1 or two are 1 and one is -1. It is easy to see that under 
this assumption we will be done as for both these cases we can find an x such that E(x) < 0. If 
it is the first case i.e. all are -l's then take x as the vector with a at the positions i,j,k and 
otherwise. The value of E at this x will be —3a 2 < 0. If it is the other case then w.l.g assume that 
dij = djk = 1 and = — 1. In this case we can choose x which has a at positions i and k and —a 
at position j. Again the value of the expression will be —3a 2 < 0. 

We now have to prove that our assumption is always true. We will prove this by induction on n 
Base: Can easily be verified for n = 4. 

Induction: Assuming the above statement holds for k = n, we have to show it for k = n + 1. 
The above statement can be interpreted in terms of a graph. Given a matrix A := a^, consider 
a weighted clique on n vertices in which weight of an edge is 1 or -1. Therefore every expression 
E represents a clique. If it is of the form {b\X\ + 62^2 + • • • + b n x n ) 2 , then we can partition the 
vertex set of the corresponding graphs into two sets S + and S~ = V \ S + such that weights of all 
edges in E(S + ,S~) will be -1 and all other edges will be 1. Suppose the statement doesn't hold 
for k = n + 1 i.e. there exists an expression which is not of the form (b\xi + 62^2 + • • • + b n x n ) 2 
but no triplet exists which satisfies our condition, i.e. all triples are either 1 or two are 1 and one 
is -1. In such a case, remove one vertex from the set and this property still holds for all triples and 
hence by induction we can assume this new graph can be decomposed into two sets S + and S~ as 
above. Now, if we put the removed vertex back then it is easy to verify that we cannot preserve 
the initial property. □ 

Theorem 5. An n x n symmetric matrix A with ±1 entries is positive semidefinite if and only if 
the graph corresponding to A is a complete bipartite graph on vertices {v%,V2, ■ ■ ■ v n }. 

Proof. If the given matrix A is a positive semi-definite matrix then for all vectors x = (x±, X2, ■ ■ ■ , x n ), 
x T Ax >. Now in general for a symmetric matrix A, x T Ax can be expanded as 



In our case, each = dbl. Now, we can appeal to Lemma 0] to conclude that the above expression 
will be non-negative iff it is of the form (bi^ci + 62^2 + • • • + ^n ;c n) 2 for some frj's G {1, —1}. From the 
proof of Lemma H] the weighted graph can be partitioned into two sets S + and S~ such that edges 
of E(S + , S~) are of weight -1 and rest have weight 1. As per the definition of A edges with weight 
1 are removed and rest have weight 1, which makes the graph corresponding to A, a complete 
bipartite graph. □ 




i<j 



Lemma 5. All points Z of the form XA where A := [ay] corresponds to a bi- clique, are in V for 
X e [0-2]. 



Proof. X := [xij] be the matrix such that X^ = 1 — (Xdij) 2 . Now XA will be in S iff the matrix 
X is positive semi-definite. Since A corresponds to the adjacency matrix of a bi-clique, there will 
exists two disjoint non-empty subsets of V = 1, 2, . . . , n, Si and S2 such that Si = A \ S2 and for 
all i £ Si and j G 52, = 1. Also for all i,j G Si, ciy = 0. and i, j G S2, a^- = 0. Since it is the 
adjacency matrix of a graph an = 0. This implies that x^ = 1 — A for all i G Si and j G £2 an d 
Xij = 1 for all i,j G £1 and i,j G S^- Also Xjj = 1. Now we know that the matrix X will be a PSD 
matrix iff there exists some n, n-dimensional vectors Ui, U2, ■ ■ ■ , u n such that Xij = (ui,Uj). Since 
xu = 1 all of these have to be unit vectors. Since 1 — A takes values in the range [—1, 1]. There 
will always be two vectors ni and 112 on the unit sphere such that (ni,ri2) = 1 — A. Therefore, we 
can choose the vectors ui, . . . ,u n as for all i G Si take Ui = ni and for all i G S2 take Ui = ri2- It 
is easy to verify that for all values of i,j, Xij is indeed {ui,Uj). □ 

The following also is provable similar to Lemma [5] 

Lemma 6. Let B be a partial clique from is obtained by removing k cliques form K n , then all 
vectors corresponding to XB belong to V for A G [0 — A&] where A& G [0 — 2] . 

Lemma 7. An edge e of the cube [0 — 2] d is completely contained inside V or completely outside 
it. 

7 Optimizing Over the Vertices is Easy 

In this section we give a characterization of the vertices and arcs of J- and show how the objective 
function can easily be optimized over the vertices. Let T be the the points of the intersection of 
the hyperplane supporting H (denoted by T~L*) the segments of the form XB where B is a partial 
clique. The vertices A, of J 7 , can be divided into types, 

Type 1: Points formed by the intersection of of H* with the edges of V which we call Ai and 
Type 2: Vertices of the hypercube which satisfy H and V . Let this set be denoted by A2. 

Notice that Ai C F. 

We now prove an interesting result showing that we can infact optimize this objective function 
very efficiently over the set of vertices. The reason is that once we find out the intersection points 
of T~L* with the with the edges of T and subsequently find the points among these which minimizes 
the objective function Y2i jeG V^J - -^ u ^ things become simpler because there aren't many edges 
which intersect with the levels of the objective function. 

More specifically, our result essentially answers the following question: given a connected graph 
G = (V, E) is it possible to efficiently find a partial clique G' on the vertex set V such that if an 
edge {i, j} is present in G then it is also present in G' and if it is not present in G then it is also 
not present in G' . The following theorem says given a connected graph G = (V, E) and a subset of 



edges E' one can easily decide whether there exists a partial clique G* = (V, E*) such that E' C i?* 
and E\E' C E 1 * , where E 1 * is the set of edges in the complement of G* . Before that we state the 
following observation that directly follows from Theorem [3l 

Observation 4. If G = (V, E) is a partial- clique on n vertices that does not contain a fixed set 
of edges E' which forms a connected component then it does not contain the clique defined by the 
vertices induced over E' . 

Theorem 6. Given a connected graph G = (V, E) and a subset of edges E' there exists a unique 
(if any) partial clique G* = (V,E*) such that E' C E* and E \ E' C E* , where E* is the set of 
edges in the complement of G* and that partial- clique can be found efficiently. 

Proof. If such a partial-clique exists then it will be of the form Ky \ {[Jl =0 Ksi} f° r some subsets 
S\, S2, . . . , S r C V for some r. Let C\, Ci, . . . , be the connected components of G' = (V,E\ E') 
with the corresponding vertex sets as Vi , V2, ■ ■ ■ , V}.. Clearly all these vertex sets are pairwise 
disjoint. Using Observation |4] we can infer that each Ky i is not present in the graph. Now we have 
to show that some other clique or a clique that contains some of these cliques is not missing. The 
first possibility is easily ruled out as G is a connected graph and hence any other clique will contain 
at least one edge in E which will violate the condition that E' C E* . Similarly, for the other case as 
well if some other Kyi is removed such that V s C V for some s, then also the same condition will be 
violated. As evident from the proof such a partial-clique (if exists) can be computed efficiently. □ 

8 Conclusion 

In this paper, we propose a well-structured family of programs called concave programming and 
investigate the combinatorial geometric structure of the feasible set of the program and show how 
to possibly use them in the context of graph partitioning problems like c-Balanced Separator. 
This is a major paradigmatic shift to attack these problems. It will of immense use to see whether or 
not such techniques can give us improved approximation factor for other problems. This also gives 
us hope that for many of the problems for which optimal approximation factors are not known one 
can possibly rely upon some "nice" programs which are although not convex but can be potential 
candidates for polynomial time solvability because of their geometric structure. Since this family 
is a new form of mathematical programming that is being used in an approximation algorithm, 
progress both in the direction of hardness and algorithms will provide more insights into the nature 
of these concave programs and can potentially lead us to optimal inapproximability results for 
various graph-partitioning problems Another tempting direction inspired from the recent results on 
the Unique Games [?] is to exploit these combinatorial geometric ideas to design sub-exponential 
time 0(l)-approximation algorithms for the problem. 
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A Appendix 

A.l Proof of Theorem [2] 

Proof. Since z p / 2 is concave for p < 2 for z > 0, and the sum of concave functions is also concave, 
the objective function is clearly concave. For the constraints defining the feasible set, Y2i<j z ij — 
c(l — c)n 2 and zu = are convex. The constraint 1 — Z >z can be shown to be convex as follows: 
Let Z\ and Z2 be two matrices corresponding to the variables z^s which lie in the feasible set. 
Therefore, they satisfy 1 — Z\ >z. and 1 — Z<i >z 0. Now, consider the line segment for AG [0—1] 
\Z\ + (1 — A)Z 2 and the matrix 1 — [XZ\ + (1 — A)Z 2 ). This is positive semidefinite as it can be 
rewritten as A(l — Z\) + (1 — A)(l — Z2) which is a sum of two PSD matrices. 
The only type of constraint left are the triangle inequality constraints. Consider an inequality of this 
type say z^- 2 + z p -^ > z p [ 2 . In general, let us look at the region x r + y r > z r for < r < 1. If r = 1/q 
for q > 1 then this region is same as (x 1//<? + y 1 ^) 9 > z. Let p\ = (xi,yi, z\) and P2 = (^2, 2/2,-^2) 
be two points which lie in this region, i.e. (xi 1 / 9 + yi 1 ^) 9 > z\ and (x2 1 ^ q + y2 1 ^ q ) 9 > ^2- To prove 
the convexity of the region we need to show that for any AG [0 — 1], (Xxi + (1 — A)x2, Xyi + (1 — 
X)y2, \z\ + (1 — A)^) also lies inside the region for all such points p\ and p2- Therefore, we have 

to show Xzi + (1 — X)z2 < I (Xxi + (1 — X)x2) q + (Xy± + (1 — X)y2) q I • Thus we will be done if 



we show 



X(x^ + y^y + (1 - A)0r 2 ^ + y^/if < [{\ Xl + (1 _ \) X2 )\ + (\ yi + (1 _ A )y 2 )i) 



1 1 \ q 

which is equivalent to proving that the function f(x,y) = ixi +y q ) is concave. We will prove 
this by showing that the Hessian of this function is negative-definite for all x, y. We now compute 
the entries of the Hessian matrix. The following calculations are easy to verify, 

21 

dx 

dy 2 V a )V^a-\) dxdy \ q )\ T \^ y \) y \ x \ 




In order to show that the Hessian is negative-definite we have to show that for any a, f3 G M, 
the following expression is always non-positive for all x, y > (for x, y as the derivatives do not 



exist): 
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which is non-positive for all a, (3 This proves that the region x p l 2 + y p l 2 > z p l 2 is a convex set for 
all < p < 2. Hence the intersection of all the triangle inequality constraints is also convex. □ 



